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Abstract. A class of models describing the flow of information within networks via 
routing processes is proposed and investigated, concentrating on the effects of memory 
traces on the global properties. The long-term flow of information is governed by 
cyclic attractors, allowing to define a measure for the information centrality of a vertex 
given by the number of attractors passing through this vertex. We find the number 
of vertices having a non-zero information centrality to be extensive/sub-extensive for 
models with/ without a memory trace in the thermodynamic limit. We evaluate the 
distribution of the number of cycles, of the cycle length and of the maximal basins 
of attraction, finding a complete scaling collapse in the thermodynamic limit for the 
latter. Possible implications of our results on the information flow in social networks 
are discussed. 
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1. Introduction 

The structural and statistical properties of evolving and dynamical networks have been 
studied intensively over the last decade [1, 2, 3], due to their ubiquitous importance in 
technology, the realms of life and complex system theory in general [4]. Transmission 
of physical quantities like electricity and of information are key network functionalities, 
both in physical networks like the power grid and the Internet, as well as in relational 
networks such as social networks [5]. The basic transmission process takes place between 
two network vertices and two constituent vertices are linked by an edge whenever direct 
transmission is possible. 

Another key network functionality is routing. An incoming physical quantity, 
package or information, arriving at a certain vertex is forwarded by this vertex. This 
routing process may proceed either via static routing tables or via dynamical routing 
protocols. The latter is the case for the Internet, the internet servers having the task of 
routing information packages such that they find their way eventually to the addressees 
specified in the package headers. Here we specify a class of deterministic vertex routing 
models with static routing tables, viz with quenched routing dynamics. The routing 
tables are drawn randomly for every vertex and the models are characterized by the 
network topology on one side and by the length of the memory trace along the routing 
path on the other side. 

In this study we focus on the effect of the routing memory on the long-term 
dynamical properties of the routing process, considering the case of information routing. 
For this purpose we consider fully connected networks and two kinds of trace memory. 
In the first case memory is absent and the package is passed on irrespectively of where 
it came from, always along the same outgoing edge, see Fig. 1. In the second case the 
memory trace consists of a single time step, and the routing of incoming information 
depends on the vertex that routed it in the previous time step; for every incoming edge 
the routing table specifics a distinct outgoing edge. We study then the statistics of the 
resulting cyclic attractors, the basins of attractions and of a measure for the degree 
of information centrality. The vertex routing models are defined in the phase space of 
directed links and a given vertex is information central of degree c = 0, 1, 2, ... when it 
belongs to one or more intersecting attractors of the information routing dynamics. 

We find that a memory trace for the routing process makes a qualitative difference. 
In the absence of memory only a sub-extensive number 0{N^^'^) of vertices is information 
central in the thermodynamic limit iV — > oo. For the case of a one-timestep memory 
trace the number of information central vertices is on the other hand extensive, being 
linearly proportional to the number of vertices A^. 

The concept of topological-based centrality and its dependence on network 
properties has been widely studied [1, 2, 3]. The notion of information centrality used 
here is, on the other side, based on the observation, that the fiux of information the 
members of a social network receive is important. This fiux of information is maximal 
whenever a person is part of one or more attractors of the information routing process. 
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Figure 1. Examples of routing tables, (a) Information is always routed to vertex 5, 
independently of where it came from (memoryless model) (b) Information arriving at 
vertex 3 from the vertices 1,4,5,2 is routed to the vertices 4,5,2,5 respectively (model 
with memory trace). 



as it dispose in this case over the entire information generated in the respective basins 
of attraction. Members of a social network located on the fringe of the information flow 
will, on the other side, receive information only from a small number of other members. 

We note that the standard network characterization of real-world networks is 
provided in terms of network topologies [6]. We propose that there is a need to 
supplement the field data with information describing the dynamics of routings, which 
would allow to evaluate the possible social relevance of the information flux and 
accumulation. The vertex routing models considered here may, in addition, be regarded 
as a reference models, akin to the Erdos-Renyi model of graph theory [7] and to the 
iVi^-model of dynamical boolean networks [8], having well deflned and controllable 
dynamical properties in the thermodynamic limit. 

2. Vertex routing models 

Our models are based on the idealized notion, that a vertex T4 receiving information 
from a vertex Vj will transmit it to one other vertex only, say V^. A vertex routing table 
T then corresponds to the binary tensor Tj^^ = {T)ijk, 



where (jk) denotes the directed edge from vertex Vj to vertex Vk, compare Fig. 1. For 
every incoming link (jk) to the vertex Vk the information is routed to a single outgoing 
link (ki), thus 




no routing 

1 routing from (jk) to (ki) 



(1) 





(2) 
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Figure 2. Example of a routing table on a fully connected network with four 
nodes. There are three distinct cyclic attractors containing the directed links 
[(Uj, (43j, (32j, (2ljj, [(34j, (42), (23jj and [(12j, (24j, (4ljj respectively. For 
the nodes 1 and 3 the information ccntrality is c = 2, while for nodes 2 and 4 one finds 
c = 3. 



where zj. is the degree of the routing vertex V^. Considering here a fully connected 
graph with N sites we have Zk = N — 1. The entries Tikj = 0, 1 of the routing table are 
determined consecutively for all vertices: For a given vertex k and a given incoming edge 

{jk) one outgoing edge {kl) is randomly chosen among the N — 2 potential candidates 
of outgoing edges. Then Tj^j = 1 and Tn^j = 0, Wi ^ I. For the two models we make 
the following differentiation: 

• Without memory For every directed edge the entries Tikj are selected randomly, and 
are independent of the originating vertex, thus we can write Tikj = Tiki, Vi, j, k, I. 

• With memory Again all entries Tikj are drawn randomly. Routing of information 
depends on where it came from, but backrouting is not allowed: Tjkj = 0, Vfc, j. 

The rational for these two models in the context of social networks is the following: 
For the memoryless case a new information is passed on always to the best friend, 
irrespectively of the information source. In the model with memory the information 
routing depends on the source. Information received by a relative might be passed on 
to another relative and work-related news might be passed on predominantly to a work- 
place buddy. The suppression of backrouting is not important for large but clearly 
makes sense; It is never a good idea to echo a joke to the person which told it in the 
first place. 

An example of a routing table on a fully connected graph with four nodes is 
presented in Fig. 2. In a discrete phase space built upon the directed edges every 
cycle is an attractor, thus we have cyclic attractors. In this example we have three 
cyclic attractors (labeled with colors), each of which has a basin of attraction of volume 
V = 4, which is the number of directed edges. The states (13) and (31) in the phase space 
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Figure 3. The information centrality /(c, N) for the models with memory, viz the 
probabihty that c attractors pass through a given vertex, for N = 10, 20, . . . , 70 number 
of vertices (top-down/down-top for c = 1/c = 0). Inset: I{0,N) (red hne) and 1(1, N) 
(blue hne) as a fimction of the lines are linear fits. 



of the dynamics belong to the basin of attractions of the red and the green attractors, 
respectively, but they do not belong to any cyclic attractor. Also, for nodes 1 and 3 the 
information centrality c = 2, while for nodes 2 and 4, c = 3. 

We also note that the vertex routing models (1) play an important role in the 
context of neural cognitive information processing. In this context a vertex corresponds 
to an object and the sequence of vertices activated by the routing process to an 
associative thought process [9, 10]. In addition there is a close relation to random 
boolean networks [11, 12], with the directed links constituting the boolean variables. 
In terms of a boolean network the routing model operate in the sparse activity limit, 
since the routing problem deals with routing of individual packages, a single directed 
link being operative at any given time. 

3. Memory less model 

In this case the routing tensor Ti^j is independent of the last index and its dimension 
is effectively reduced to two. The probability distribution Ni{L, N) of finding a cycle of 
length L in a network with nodes is given by [see Appendix A] 

1 /'N\ D 1 

'^'(^•'^'-IMU)T(7v3T)r. (3) 

where z[N) is a normalization factor, (^) the number of L sites out of the A^ vertices 
and L\/L the number of possibilities to connect L sites into distinct loops. The factor 
(A^ — in (3) is the probability that any ordered set of L sites is connected via the 
routing table, with 1/(A^ — 1) being the probability of two vertices being connected. 

For the memoryless model the phase space reduces effectively to the number of 
vertices A^, compare Fig. 1, and the cycles are disjoint and non-intersecting, only c = 0, 1 
cycles may pass through any vertex in the memoryless model; any given vertex may 
have only an information centrality c = 0, 1. Defining by J(c, A^) the distribution of the 
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Figure 4. The distribution Na{n,N) of the total number n of cycles per realization, 
for N = 10, 30, 50, 70 numbers of vertices. Inset: The average {Na{Ti, N)), as a function 
of N, together with a power-law fit (solid line). 



information centrality we find [see Appendix B] 



for the memoryless model, where we have used (3) and (L) = Y^j^ LNi[L, N). 1(1, N) 
scales like 



I{l,N) ~ l/VN, N ^oo (4) 

in the thermodynamic limit. Only a sub-extensive number of vertices belong to an 
attractor of the information flow and essentially all vertices are excluded from the long- 
term flow, /(O, iV) ^ 1 for ^ oo. 



4. Model with memory trace 

We have performed extensive numerical simulations of the vertex routing model with 
a memory trace. We studied ensembles of Nreai = 10^ randomly drawn realizations of 
the routing tensor, considering fully connected networks having = 10, ...,70 sites, 
and performing an exhaustive search of all attractors and their respective basins of 
attractions. We evaluated the normalized distributions 

J(c,iV), Na{n,N), Ni{L,N), Vr'{v,N), (5) 

where /(c, A^) is the distribution of the information centrality, with c being the number 
of attractors passing through a vertex, Na{n,N), the distribution of the number of 
attractors n per network, Ni{L,N) the distribution of the lengths L of attractors and 
the distribution of the volume of the largest basin of attraction V^"'^{v,N). For every 
model realization all basins of attraction were evaluated and the statistics of the maximal 
volume is given by V^'""^'(f , A^), with the volume v being defined relative to the entire 
phase space Q = N{N — 1) of directed links. 
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Figure 5. The cycle-length distribution Ni{L, N) as a function of cycle length L, for 
N = 10, 20, . . . , 70 number of vertices. Inset: The median fia{N) of Ni{L, N), as a 
function of N. 



We find, see Fig. 3, that the information centrality approaches a well defined 
limiting function /(c) = limTv^oo -^(c, N) in the thermodynamic hmit. The availability 
of information is quite democratically distributed, only a fraction /(O) ~ 0.34 of vertices 
are cut-off completely from the long term information flow. 

In Fig. 4 the distribution of the number of attractors per network is presented. 
The average number of attractors per network increases only slowly with the size of 
the network, as {Na{n, N)) ~ A^"". Somewhat larger system sizes are necessary for 
a reliable estimate of the scaling exponent, our best fit (given in the inset of Fig. 4) 
indicates aa ~ 0.29. 

We encountered problems of undersampling of the space of all possible model 
realizations when evaluating the cycle- length distribution Ni{L, N), presented in Fig. 5, 
which is a phenomena well known in the field of random boolean networks [12]. For the 
case of the vertex routing model the probability of finding very long cycles could not be 
determined accurately, due to the fat tails of Nl{1,N). This problem affects also the 
results for the mean cycle length {Ni{L, N)), but not the median /ia(A^) of Ni{L, N). We 
found scaling close to a square-root law for the median (inset of Fig. 5), fia{N) ~ A^*^'^^. 

The number of cyclic attractors steadily increases with A^, as shown in Fig. 4. The 
question is then, whether there is typically a single dominating attractor, in terms of 
the size of the respective basins of attractions, or whether the phase-space volume is 
more or less equally divided between the attractors being present. This information 
is provided by V^"'^{v,N). The probability that the largest attractor volume is in the 
interval [fi, ^2] is given by Yll^=vi Vb^°'^{v, N). Here the volume is relative to the maximal 
basin of attraction, which is equal to the phase-space volume Q = N{N — 1) and v = 1 
occurs when only a single attractor is present. 

The rescaled f2V^'"°^(f , A^) converges rapidly with A^ to a limiting function, see Fig. 
6. There is a divergence for v ^ 1, due to the fact that the probability of finding an 
ensemble realization with a single attractor, Na{l, N), scales like 1/A^ and consequently 
Qymax(^^ = 1, AT) = QNain = 1,N) ^ N. A kink at i; = 1/2 occurs for Vf^'^'^iv, N). For 
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Figure 6. The rescaled distribution N{N — l)V™'^^{v, N) of the volumes of the basin 
of attractions, for N = 10, 30, 50, 70. The volume v is normalized with respective to the 
total phase space il = N{N — 1) of directed links. Inset: The integrated distribution 
/J'du'14™°^(w',7V), for N = 30,50,70 (the curves faU on top of each other). For 
comparison the result (6) for the case of two attractors with randomly distributed 
basins of attractions is given (dashed line). 

V > 1/2 ensemble realizations containing two or more cycles contribute to Vf^°'^{v,N), 
for V < 1/2 realizations with three or more attractors contribute. 

The divergence of QVfl/^"'^ {v , N) for f — > 1 seems to indicate that cycles with large 
volumes of attraction would have a dominating role, controlling most of the long- 
term information flow. In order to understand the origin of this divergence we have 
compared the integrated distribution dv'V^""^ {y' , N) (inset of Fig. 6) with an unbiased 
distribution of basins of attraction, taking the case of two attractors (dashed line in the 
inset of Fig. 6). In this case, when only two attractors are present, one of the volumes 
of attraction is always equal or larger than i7/2, and 

The integrated distribution of model with a memory trace follows somewhat the 
simplified model (6) of a random distribution of two basins of attraction, albeit with 
a substantial suppression close to unity, indicating that the divergence of VtV^"'^{v^N) 
for f — i> 1 is a directly related to the statistical properties of the distribution of basins 
of attraction, independent of the details of the dynamics. 

5. Discussion 

We have presented and analyzed a novel class of models suitable for describing the flow 
of information in complex networks. In these models the flow of information is realized 
by information packages travelling along the edges of a communication network, whereas 
it is assumed, in the intensively studied information diffusion models, that information is 
an attribute attached to the vertices and not to the edges. The dynamics is determined 
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in our class of models by routing tables, specifying respectively for every vertex the 
routing of incoming information packages. 

The two vertex routing models discussed in the present study constitute idealized 
reference models, not suitable for a direct modeling of field data. The key point of the 
present study is to analyze the non-trivial effects of a memory trace on the long-term 
information flow and not the dependence on the network topology, which is left for 
further studies. Information is conserved in the present models. This would be clearly 
not the case for real-world social networks but it holds for the network of internet 
routers, which have the task of routing packages of information without increasing or 
decreasing their number. 

Traditionally, the flow of information in social networks, like the random spreading 
of rumors, has been modeled by diffusion processes [13]. Searching for more realistic 
models the special topology of social networks has been discussed intensively [14], as 
well as corrections to the diffusion process itself [15]. The vertex routing and the 
diffusion models of information flow constitute two extremes. In the flrst model the 
direction of the information flow is 100% deterministic, in the second model 100% 
random. The information-flow occurring in real-world social nets is expected to be 
partially a random and partially a directed process. It will therefore be of interest for 
future studies to interpolate between these two reference models. We propose that field 
studies characterizing social networks should be supplemented by data describing the 
dynamics of the information flow, in addition to the standard structural and topological 
characterization, as the accumulation of information in the attractors of the information 
flow may have a substantial social impact. 

The dependence of the routing dynamics on the network topology is additionally 
an important issue for future studies. Cycles in the routing process can only then 
appear, when the underlying network topology allows for loops. Here we have been 
considering fully connected networks and loops of all length are present. For many 
classes of real-world networks there is a characteristic loop-length, which generally scales 
sub- extensively with the network-size [16]. Interesting interferences phenomena between 
this scaling and the sub-extensive scaling of the typical attractor-length (inset of Fig. 
5) may then be expected. 

Appendix A. The cycle length distribution for the memoryless model 

Let qt be the probability that a path remains unclosed after t steps. If a path is still 
open at time t, we have already visited t + 1 different nodes. There are t ways to close 
the path in the next time step. The relative probability is then pt = t/{N — 1). The 
probability of still having an open path after t steps is 




Qo 



qt = 




{N -iy{N -i-t)\ ■ 
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The average number of cycles of length L is 

qt=L-i N N\ 



{NiiD) 



N-1 L L{N - 1)^{N - L)\ ' 
where we used the following considerations [4] : 

(i) The probability that the node, visited at time t + 1, is identical to the starting node 
is 1/(A^- 1). 

(ii) There are N possible starting points. 

(iii) Factor 1/L corrects for the overcounting of cycles when considering the L possible 
starting sites of the L-cycle. 

After normalization we obtain Eq. (3) , the probability distribution of cycle lengths 

1 A^! 
^'^^'^^ " z{N) L{N -1)L{N -L)\ ' 

where z{N) is normalization factor 

^ ^L(JV-l)'(JV-L)! (N - 1)" (N - k)k\ ' 

L — 2 h — 

Appendix B. Connection between the average information centrality and 
the average cycle length 

We consider an ensemble of R random realizations of a routing tensor on a fully 
connected network with N nodes. Let be the total number of vertices which belong 
to at least one cyclic attractor, where a = 1 . . . R . 

The length of an cyclic attractor is equivalent to the number of vertices that belong 
to the attractor. In the case of only one existing attractor ria — ^^=r {^~'^)Qa{^-: 
where L is the length of the attractor and Qa{r, N) is the number of nodes which are 
repeated r times during one cycle of the cyclic attractor. If we denote the number of 
cycles of length L with Na{L, N), we can write, for the case of more than one co-existing 
cyclic attr actors, the following relation 

f^max C-max 

n„ = Y,LNa{L,N) - Y,{r - l)Qa{r,N) - ^(c - l)P«(c, AT) , (B.l) 

L r=l c=l 

where Pa(c, N) is number of nodes with information centrality c. For example let us 
consider the case when we have two attractors [(125, (23), (Si) ] and [(145, (45), (5l], 
(16), (67), (71)]. It is easy to count that there are 7 distinct vertices contained in this two 
attractors. We sec that only node 1 has information centrality c=2, thus Pa{c = 1) =6 
and Pq(c = 2) = 1. Also, node 1 is the only one two repeat two times during one cycle 
of second attractor, thus Qai'r = 1) = 6 and Qa{T = 2) = 1 . If we put this values into 
(B.l) we obtain = 7. 
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On the other hand, if we denote the distribution presented in Fig. 3 as I{c,N), 
where c is the information centrahty, then, in the case of one given reahzation of the 
routing tensor, we can write 

P^{c,N)^N-Uc,N) . (B.2) 

Also, total number of vertices, which are members of cyclic attractors, is then 

na^Nj2la{c,N) . (B.3) 

c=l 

Combining (B.l) and (B.2) with (B.3) we obtain 

c=l L c=l 

f-max 

-J2{r-l)Q^{r,N) , 

r=l 

After averaging over entire ensemble of R realizations of routing tensor and dividing 
both sides of (B.4) with A^, we obtain 



Cmax / 7vr\ -i '^max 



N ^ ' ' N 

c=l L r=l 



where we have used following relations 

R 



i^Ar,(L,Ar)=z(Ar).Ar,(L,Ar) , 

1 ^ 

-J2Uc,N)^I{c,N) , 



a=l 
1 ^ 

-J]g„(c,Ar)=g(c,Ar) . 

In the case of the memoryless model, I{c, N) have non zero values only for c = 0, 1, 
and it is not possible that one node is repeated more than once in one cycle of cyclic 
attractor, thus '^^i''^ ~ l)Q(^) A") — 0. Therefore, from (B.5) we obtain 

I{l,N) = '-^{L) , (B.6) 
where (L) = LNi{L, N) , which is the central result of this appendix. 
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